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Abstract 

The strong subadditivity of entropy plays a key role in several ar- 
eas of physics and mathematics. It states that the entropy S[g\ = 
—Tr(^g\ng^ of a density matrix ^3123 on the product of three Hilbert 
spaces satisfies S'[^3i23] — 5'[£i23] < S[gi2] — S[g2]- We strengthen this 
to 5'[gi23] — 'S'[^?i2] < X]q '^"('^'[£'23] ~ '5'[£?2])) where the are weights 
and the f?23 partitions of ^>23- Correspondingly, there is a strength- 
ening of the theorem that the map A i—f Trexp[L -I- In A] is concave. 
As applications we prove some monotonicity and convexity properties 
of the Wehrl entropy and entropy inequalities for quantum gases. 



1 Introduction 



The strong subadditivity of entropy (SSA), whose proof in the non-com- 
mutative case was given by Lieb and Ruskai ^ |2] , is one of the main in- 
gredients in various fields of mathematics and physics in which the von 
Neumann/Shannon entropy plays a role. Over the years other proofs have 
appeared SSA is an inequality among various entropies that can 
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be formed from one density matrix on the product of three Hilbert spaces 
"^123 = 7i2 'H3 and states that 

S[gi23] - S[gi2] < S[q23] - S[q2] . (1) 

Here, gi23 is a density matrix (i.e., a positive semi-definite operator whose 
trace is 1) on the tensor product space "^123, and gu is the reduced density 
matrix on 7ii2 = TLi Ti.2, formed by taking the partial trace of ^^123 over 
H3 (i.e., gi2 = Tr>{3^i23), and so forth. The entropy S[g] on a Hilbert space 
7i is given by the von Neumann/Shannon formula 

S[g] = -TTn{glng). (2) 

(Henceforth, the Hilbert space notation TC on the trace, Tr , will be omitted 
if it is not needed, or we may simply write Tri to denote Tr^^^ , etc.; likewise, 
5*123 will denote 5'[£)i23] , etc., when the meaning is clear.) 

Inequality appears to be straightforward enough that it seems un- 
likely that it can be improved, i.e., that one can insert a quantity between 
the left and right sides that preserves the inequality. That, however, is what 
we do in this paper (cf. Eq. @). Admittedly, our theorems can be derived 
from SSA (or, equivalently [HI, from the monotonicity of relative entropy 
under completely positive trace preserving (CPT) maps) and thus, when 
viewed from a sufficiently remote perspective, there is little new here. From 
the point of view of applications and of understanding the potential of SSA, 
however, our results and proof techniques may merit attention, especially 
our applications to the theory of quantum gases in CorollaryEJof Section HOI 

Inequality ^ is written in a slightly unusual way. Instead of the usual 
5*123 + 5*2 < 5*12 + 5*23 , © displays the inequality as the decrease of the 
conditional entropy 5*23 — 52 when 2 is replaced by 12, i.e., information about 
the state on Tii is added. Our focus will be on the conditional entropy. 

In Section |31 we give some examples of the utility of the improved version 
of inequality Eq. As one example, we show that the 'mutual 

information' 5i + 5*2 — 5*12 is decreased if the density matrix is replaced by 
Wehrl's corresponding classical phase-space function (whose definition will 
be recalled later). Wehrl had shown [2j that his entropy is always greater 
than the true entropy, but the monotonicity of the difference 5*i2 — 5*1 — 52 
is new. This is a special case of Corollary 21 below. We also show that the 
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difference between the Wehrl and the true entropy is a convex function of 
the density matrix. 

Originally, we had proved the monotonicity of S12 — Si , and we are grate- 
ful to M.B. Ruskai for suggesting the stronger version to us; her argument, 
which uses the theory of CPT maps, is briefly sketched in Appendix 10 [S]. 
We also acknowledge other helpful correspondence about this paper. 

In another direction, it will be recalled that one of the ways to prove 
SSA is by means of the theorem [Sj (for one Hilbert space) that the map 



for positive definite operators A is concave for each fixed self-adjoint L. 
This, too, will be improved, and its improvement will lead to the improved 
version of SSA. 

Our main result is the following. 

Theorem 1 (Stronger Subadditivity). LetTCi, i = 1;2,3, be separable 
Hubert spaces, and let Q123 be a density matrix on ^123 = 'Hi®'H2®T~(-3 with 
finite entropy. Let Q be a measure space, with elements labeled by a, and let 
dfi{a) be a measure on $7. Let be bounded operators on 7ii2 = ® 'H2 
that are weakly measurable, and satisfy ( with K"^* the adjoint of K") 



A^Ti exp (L + In A) 



(3) 




(4) 



With the usual notational abuse (g) 17^3, let 



n' 



a 



Tri23K''Qi23K' 



a* 



(5) 



and, in case > 0, let 



Q23 



a 



(6) 
(7) 




Tri3i^"£>123i^"7n' 



a 



Then 



S[qi23] - S[0i2] < [ dfi{a)n''{S[0^,] - S[0^]) 



(8) 
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Remarks. 



2. 
3. 

4. 

5. 



We recall that weakly measurable means that {4>\K°'\'ip) is measurable 
for any vectors and in the Hilbert space. (This is implied, via 
polarization, by the seemingly weaker condition of measurability for 
all = IV')-) The integrals then have to be interpreted in the weak 
sense, e.g., © means that J^d^i{a) {(l)\K°'* K"-\ip) = ((/>!'(/') for all 
and IV")- 

Because of cyclicity of the trace, = Tr if"*iir"^3i23, and hence ^ 
implies that dfi{a) = Tr Q123 = 1. 

Both sides of the inequality © are homogeneous of order 1 in ^3123. 
Hence this inequality holds also without the normalization condition 
Tr^>i23 = 1, i.e., it holds for all positive trace class operators. 

It is no restriction to assume the K'^ to be bounded. Because of (jlj), 
they must be bounded almost everywhere, and hence one can absorb 
their norm into the measure d^{a). 

In case all the act non-trivially only on 7ii (i.e., if" = k"' and 
diJ,{a) k"^* = Ifi-^) we have that j^dfj.{a) Q22, = Q23- Since the 
map ^323 > 'S'[^>23] — S[q2] is concave, as shown in 1,, the right side of 
(jHl) is bounded above by 5'[^>23] — S[g2] in this special case. Theorem^ 
is, therefore, stronger than the usual strong subadditivity of entropy 
because we have 



in this case. 

Everything remains true in the classical case as well. That is, TCi is 
replaced by a measure space, g is replaced by a measurable function 
and the trace is replaced by an integral. The K°' are then functions 
on the measure space. Note that in the limit that K"^*K" is just a 
(5-function supported at a point in the measure space, labeled by a. 
Inequality Q is actually an equality in the classical case. 




(9) 
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1.1 The special case of matrices and sums 

To keep things simple we shall first deal with the finite dimensional case, 
when Tii = C""' for finite rii, and with the case where the integral in Q is 
just a finite sum. In this special case, Theorem ^ is then just Theorem |21 
below. We will first prove Theorem|21 The extension to the case of a general 
measure space in @ is given in Appendix^ and the extension to the infinite 
dimensional case is given in Appendix IbI 

Theorem 2 (Stronger Subadditivity, Matrix Case). Let gi23 be a 
density matrix on a finite dimensional Hilhert space 7^123 = 'Hi ® 7i2 ® Ti-s- 
Let K°^, 1 < a < M , he a finite set of operators on 7ii2 = Ti-i 7^2; that 
satisfy 

^i^"*K° = %, . (10) 

a 

With the usual notational abuse ® I-^^g , let 

n" = Tri23i^>i23i^"* (11) 

and, in case n" > 0, let 

^f3=TYiK>i23i^"7n" , (12) 
= Its Q% = Tri3 i^>i23^°7™° • (13) 

Then 

S[Q12,] - S[ei2\ < ^^"(^[^23] - S[q^]). (14) 

a 

We will give two independent proofs of Theorem [21 in the next section. 
The first one uses Theorem El below, which states the generalization of the 
concavity in Q mentioned above. The second proof uses the theory of 
completely positive maps. 

Theorem 3 (A concave map). Let L he a self-adjoint operator on a finite 
dimensional Hilbert space 7i. For 1 < a < M , let A" be positive operators 
on 7i and let be operators such that K"* K'^ < I-}^. Then the map 

{A\...,A^^) ^Trnew + i^"*(ln^")i^"^ (15) 
is jointly concave. 



a=l 
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Remarks. 

1. This theorem was proved in Thm. 6] for one A and = I, and 
it was generahzed there, in Corollary 6.1, to A^, . . . , A*^, but only in 
the case that the K" are nonnegative numbers with < 1 . 

2. Theorem |21 can be extended to the infinite dimensional case as well, 
using the methods of Sect. 4 in 9 . It is also possible to generalize 
to the case of continuous variables a in some measure space in 
this case. A" is a measurable function on with values in the posi- 
tive operators, and the sum over a in (|15|) is replaced by the integral 

dn{a) iC"*(lny4")Er", with i^T" satisfying dti{a) K'^*K°' < I. For 
simplicity we will not give this generalization here since we will not 
need it for the proof of our main Theorem ^ 

3. Note the switching of and K""* between (HTJ-lISl) and Note 
also that only the inequality J2a K"*K" < I is required for Theorem|31 
whereas equality is necessary in Theorem |2 



2 Proof of Theorems [2] and [31 

Proof of Theorem\^ We start with two preliminary remarks, (a.) It is 
clearly enough to assume that j{o:*j^^a _ £qj, otherwise we can add 
one more K^^+^ = (I - A""*-?^")^/^ and take A^^+i = I. (b.) We can 
also assume that all K'^ are invertible; the general case follows by continuity. 

Let /C = H^C*^. Every operator in B{JC) can be thought of as an M x M 
matrix (indexed by a,/5) with entries in B{H). Define L, A, P & B{1C) by 

Aaf} = 6a^pA'', (17) 

and 

Paf3 = K"Kf^*. (18) 

Note that P = P* and (since k'^*k<^ = I) P^ = P, so P is an orthogonal 
projection. We know from |31 Thm. 6] that the map 

(A\ . . . , A^^) ^ Trx; exp (-A(I - P) + L + Inl) (19) 
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is concave, for every A G R. The concavity property survives in the limit 
A ^ oo, in which Umit the operator in the exponent of (|19() is — oo on the 
orthogonal complement of PIC. Therefore, in the A — > oo limit the concavity 
in H19() becomes the statement that 



L + lnA 



P 



(20) 



is concave. We shall show that 



TVp^ exp ( P 



L + lnA 



P) = Ttn exp (^L + Y, (In j , (21) 



which finishes the proof. 

Eq. H21() can be proved as follows. The trace on the left side is over PIC, 
which is isomorphic to In fact, the map U :?{ PIC, defined by 



(22) 



is clearly onto since every vector in PIC has the form K"'^. Moreover, since 
K"*K" = I, U preserves norms and hence U is a unitary. A simple 
calculation shows that 



U*P 



L + lnA 



PU = L + ^if"*(lnA")i^". 



(23) 
□ 



Proof of Theorem using Theorem We need to show that 

TVi23 ^123 In Q123 + In Qi2 - K"* (In - In g^^) K"^ < 0. (24) 

Using the Peierls-Bogoliubov inequality jlUj . we see that (|24() holds if we 
can show that 



TVi23 exp l^ln gu - K"* (In - In g^^) K"^ < 1. 



(25) 



We now use an idea of Uhlmann Let U3 be a unitary operator on Ti^, 
and let dU^ denote the corresponding normalized Haar measure. Since the 
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trace is invariant under unitary transformations, and since the commute 
with Us, we see that the left side of equals 

j Tri23 f/| exp ^In qu - (1^ Q2 " In 0%) Us dUs = 

J Tri23 exp ^In - ^ K^* (In + E U*sQ%U^) i^"^ dU^. 

(26) 

Now / [f^3 ^23^3] — "^"^1^3 ^23 — d~^Q2^ where d denotes the dimension 
of TCs . Using the concavity result of Theorem |31 we see that 

(HH) < Tri23 exp ^In qu - ^ i^'^^i^" (In d)^ = 1. (27) 

The last equality follows from j{^a*j{a _ j ^^^^ Tri23 Q12 = d. □ 

Proof of Theorem\B using CPT theory. Consider the map ^ : Tii "^2 C3 

given for a general density matrix ^123 by 

<I>(£'i23)=0n>?3- (28) 
a 

This map is trace preserving and completely positive (CPT) jl2| . It is known 
that the relative entropy, H{g,'y) = Tr g{lng — In 7) decreases under such 
maps El ! ^nd hence 

H{0i23, Q12 ® ^3) > H{^{gi23),<P{0i2 ^•3))- (29) 

The left side of this inequality equals S'[£»i2] + SIqs] — 5'[^>i23]. To compute 
the right side, note that 

H0i2®g3) = ^n''g2^ 03- (30) 

a 

It is then easy to see that the right side of equals J2a ^"(.'^[02] "'^'[£'23] + 
5'[^3]). Thus is the same statement as (fTl)) . □ 
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3 Corollaries and Applications 

Taking Ti.2 = C, we get as an immediate corollary of Theorem ^ and the 
concavity of g S[g]: 

Corollary 1 (Improved Subadditivity). Let Q12 be a density matrix on a 
separable Hilbert space 7ii(^TC2- Let be positive, bounded and measurable 
operators on TLi, with j^diJ,{a) P" = Ii. Let n" = Tri2-P"£'i2 and, in case 
n° > 0, let = Tri P"£»i2/n°. Then 



Remarks. 

1. In the notation of Theorem ^ P" = K°'*K°', but there is no need for 
this splitting in this case. 

2. One may wonder whether (|31|) holds if gi is also split in a manner 
similar to g2- This is not true, in general! As a simple example, 
consider the case when Til = W2, and gu = ELi ® H", 
with n" being mutually orthogonal one-dimensional projections. With 

= n" we have S[gi2] = Ind, whereas S[g'^] = S[g^] = for all a. 

3.1 Classical Entropies 

Now, suppose we are given a partition of unity of both 7ii and H2, i.e., a 
finite set of positive operators and such that 



For simplicity, we restrict ourselves to the case of a finite dimensional Hilbert 
space and discrete sums in this subsection, but, using the methods described 
in the appendix, one can extend the results to the case of infinite dimensional 
(separable) Hilbert spaces and integrals over general measure spaces. 

For gi2 a density matrix on TCi 7i2, we can define a 'classical' entropy 

as 




(31) 




(32) 




(33) 
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Analogously, we can define 5*^ [fpi] and 5"^ [£»2] for density matrices on Hi or 

In the case where the and are one-dimensional projections, this 
definition agrees with the one in ^Sj. Note, however, that we define the 
classical entropy here for any partition of unity. If this partition is trivial, 
i.e., = I for a = 1 and zero otherwise, then 5"^'[£>i] = identically. In 
particular, it is in general not true that S[gi] < S'^^[qi]. However, this is true 
if the partition is such that TV < 1 for all a, in which case this inequality 
follows from concavity of x i— > — xlnx. 

Corollary n above can be used to prove the following inequality for the 
mutual information S[gi] + S[q2] — S[qi2]- 

Corollary 2 (Quantum mutual information bounds classical mutual 
information). For any density matrix Q12 on 7ii ® 7i2, and any partition 
of unity as in \8^) . 

S[qi] + S[q2] - S[qi2] > S%i] + S%2] - S%i2]. (34) 

Note that the right side of (|34j) is just the classical mutual information, 
since Tn P'^gi = Tri2 P"Q^f?i2. 

Proof. We learn from Corollary ^ that 

S[0i2]-S[gi]<^n''S[Q^], (35) 

a 

where n" = Tri2 P"^>i2 and Q2 = Tri P°^>i2/n°. On C*^ TL2, define a 
density matrix gi2 as 

?i2 = 0n"^f. (36) 

a 

Then the right side of H35|) can be written as 

Y.n''S[e^] = S[ei2]-S%i]. (37) 

a 

Note that ^2 = Tri Q12 = n'^g'^ = g2- 

We now apply inequality Corollary ^ again, this time to the expression 
'S'[^i2] — S[g2]. This yields 

S[gi2] - S[g2] = S[gi2] - S[g2] < 5^m^5[ef], (38) 

/3 
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with = TvuQ^Qu and = Ti2Q^ 012/^^ ■ The right side of this 
expression equals 

Y,^^S[t\=S%i2\-S%2]. (39) 

In combination (|35|) - (|39j) give the desired result. □ 

Another way to interpret the results above is the following: define a 
partially classical and partially quantum entropy by 

5-l'Q[f,i2] = - ^ Tr2 (Tn P"^>i2) In (Tn P^qi2). (40) 
a 

Then Corollary ^ and the proof of Corollary |^ show that 

S[Q12]- S[qi]- S[q2] < S'''^%i2]-S%i]-S[Q2] 

< S%i2]-S%i]-S%2]. (41) 

In the same way, Theorem 21 in the special case where the -fC" act non- 
trivially only on Tii, can be interpreted as 

S[qi2z\ - S[qi2] < ^^^•^''^[f^m] - S''^%i2] (42) 

for a partition of unity on Tii, with the obvious definition of S"^''Q'Q. We 
can use this inequality to prove the following. 

Corollary 3 (Convexity of classical minus quantum entropy). The 

map 

^12^S^'''^M-%12] (43) 

is convex. 



Proof. Let A12 and B12 be two density matrices on Hi® 712- On ® 
C^, consider the density matrix g>i23 = |^i2 •K' H + ^-612 (I — H), where 11 
is a one-dimensional projection in C^. Inequality (|42() implies that 

\S[Ai2\ + \S[Bi2\ - S[qi2\ < i5^''^[^i2] + i5^''^[Si2] - S'''%i2]. (44) 
Since Q12 = 5^12 + k^u, this proves the result. □ 
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Remark. In particular, taking 7i2 = C to be trivial, this corollary shows 
that the map (for a single Hilbert space) 

Q^S%]-S[e] (45) 

is convex - which is remarkable, given that both entropies are concave func- 
tions of Q. The inequality implied by convexity of (|45l) is known as the 
Holevo bound ^^^7\. 



3.2 Coherent States and Wehrl Entropy 

Now, suppose we are given a coherent state decomposition of both TCi and 
TC2, i.e., normalized vectors \ip) G TCi, \9) G 7^2 and positive measures /x and 
on some measure space (not necessarily the same spaces) such that 



dfi{ip)\ip){ip\=h , / di^{e)\e){e\ = h. (46) 



Here, is the Dirac notation for the one-dimensional projector onto 

ip and the integrals are to be interpreted in the weak sense, as explained 
before. 

The classical (Wehrl) entropy ^ for a density matrix gi2 on Tii Ti.2 is 
then defined as 

S'^[qi2] = -J dfi{^)di^{e){^,6\gi2\y^,e)ln{ip,9\Qi2\ip,9), (47) 

and similarly for density matrices on TCi or 7i2- As Wehrl showed, it follows 
from concavity of x i— > —xlnx that 

S[0i2] < S'^iQu]. (48) 

Corollaries 121 and El now also hold for the Wehrl entropy. In particular, 
it is true that 

S'^lQl] + S'^[q2] - S'^iQu] < S[0i] + S[02] - S[0i2]. (49) 

Moreover, Corollary El implies that the map 

g^S'^iej-Sig] (50) 
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is convex. Note that the infimum of this function is zero. In the finite 
dimensional case, this infimum is achieved for the totally mixed state q = 
d~^I, where d = diuiTi. 

It might be recalled that Wehrl raised the question ^ of evaluating the 
minimum of his classical entropy and conjectured, in the special case of the 
Glauber coherent states, that it should be given by the one-dimensional 
projector onto a coherent state g = \0){9\. (The minimum of the quantum 
entropy, — Tr ^In g, is always zero.) This particular conjecture was proved in 
[TH] , where the (still open) generalized conjecture was made to SU{2) (Bloch) 
coherent states. Oddly, the minimum of the difference of the entropies is a 
much easier question to answer. 

Note that because of convexity the maximum of the function S'^[^>] — S'[f3] 
is attained for a pure state, where S[q\ = 0. Hence the question about the 
maximum of S^[Q\ — S[g\ is equivalent to maximizing S^[g\ over pure states. 
In the case of the Glauber coherent states, this maximum is infinite [Jj. 

3.3 Quantum Statistical Mechanics of Point Particles 

Consider a system of two types of particles, A and B. The state space of 
the combined system is H = TLa ®'Hb-, where TLa and TLb are the spaces of 
square integrable functions of the particle configurations of particles A and 
B, respectively. We assume that the configuration space is M*^-^ and R'^^, 
respectively, for some finite dA and ds- The usual subadditivity of entropy 
implies that, for any state g on TC, 



where qa and gs denote the states of the subsystems. However, in applica- 
tions it can be useful to get a lower bound not only on the entropy oi qb, 
which is the state averaged over all configurations of the A particles, but 
rather on the average entropy of the state for fixed A particles. Such a bound 
is one of the key ingredients in a rigorous upper bound on the pressure of a 
dilute Fermi gas at non-zero temperature 

More precisely, if Xa and Xb denote particle configurations of the A 
and B particles, any density matrix on Tl will be given by an integral kernel 
g{XA, Xb] X'^, X'q). For every fixed configuration of the A particles, Xa, 




(51) 
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we can then define a density matrix on TCb by the kernel 

g^^{XB,X's) = n{XAr' q{Xa,Xb;Xa,X's), (52) 
where u^Xa) is the normalization factor 

n{XA) = j dXBQ{XA,XB;XA,XB). (53) 

Since is a trace class operator, g^"^ is well defined by the spectral de- 
composition of Q for almost every Xa, if n{XA) 7^ 0. The definition (|52)) 
makes sense only if n(X^) > 0; only in this will be needed below, 

however. 

Note that n[X a) is the probability density of a configuration of A parti- 
cles Xa- Moreover, j dXAn{XA) = 1, and / dXAn{XA) Qb^ = Qb- There- 
fore we have, by concavity of S[g], 

S[qb] > J dXAn{XA)S[g^^]. (54) 

Hence the following is a strengthening of (|51j) . 

Corollary 4 (Subadditivity with Average Entropy instead of En- 
tropy of Average). Let g be a density matrix on TCa ®'Hb with finite 
entropy. With the definitions given above, 

j dXAn{XA) S[gl^] > S[g] - S[gA]. (55) 

We remark that it is not necessary to have a fixed particle number for 
this bound; the integral J dXA can as well include a discrete sum over dif- 
ferent particle numbers. I.e., our bound also applies to the grand-canonical 
ensemble, and this is the form that Corollary |3 actually gets used in jl9j . 
For simplicity, we consider only the case of a fixed particle number in the 
proof below, but the extension is straightforward, using an additional de- 
composition I = J2n>o ^rii where P„ projects onto the subspace of TIa with 
fixed particle number n. 

Corollary follows from Corollary ^ by the following limiting argument. 

Proof of Corollary^ For d = dA, let j : M'' M be a positive and integrable 
function on the configuration space of the A particles, with J dX j{X) = 1. 
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For some e > and F G M'^, let jj {X) = e-'^j{{X - Y)/e). Let 

P' = l- ( dY\j^){j^\. (56) 

It is not difficult to see that > 0. Hence we can infer from Corollary ^ 
that, for any density matrix Q12 on "Hi (8)7^2 (where we use again the notation 
1 and 2 instead of A and B), 

S[qi2\-S[qi]< I dYne{Y)S[Q'',]+np.S[Q^'l (57) 

where we denoted ne{Y) = {jJ\Qi\j^), = (j^|£>i2| j^)/ne(y), npe = 
TiiP^ Qi and = TviP^ Qi2/npe . We will show that there exists a sequence 
Ej with £j — > as j ^ CO such that the right side of (|57j) converges to the 
left side of (|55|) in the limit j ^ 00. 

We note that, for any square integrable function (f) on W^, — > (t>{Y) 
strongly in L'^{M.'^) as e — > 21, Thm. 2.16]. Passing to a subsequence, it 
is then true that {jY\(p) (PO^) almost everywhere. Decomposing qi into 
its eigenvalues and eigenfunctions, we see that there is a subsequence such 
that lim£_>o ?^£(^) = n(Y) for almost every Y. Also, weakly as 

e — > for a.e. Y . (Here we used the separability of the Hilbert space to 
ensure the existence of this subsequence.) Since also the traces converge, 
this convergence is actually in trace class norm |2nj . 

We first assume that Q12 has finite rank. Then also and g^ have finite 
rank, and hence bounded entropy. It follows that lim£_^o 'S'[^?^] = 
Moreover, it is easy to see that ^ weakly as e — > 0. This implies that 
lime^o^-P^ = 0, and hence lim^^o / dY UeiY) = 1. It then follows from 
Fatou's Lemma that 

lim / dYne(Y)S[gJ]= [ dYn{Y)S[gl]. (58) 

'^^O jR'i JRd 

It remains to show that the last term in ()57() goes to zero as e ^ 0. As 
already noted, lime^o^^P^ = 0. The entropy 5'[i3|'^] need not be bounded 
as e — > 0, however. Since < I, TriP^^i2 < gs- Note that gi2 has finite 
entropy by assumption and, without loss of generality, also gA has finite 
entropy. This implies that gB has finite entropy by the triangle inequality for 
entropies ^22^. Hence it follows from dominated convergence 1, Thm. A3] 
that S[TtiP^ gi2] — > as e — > 0, and hence np6S[g2^] = S'[TriP'^^>i2] + 
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npe In npe — > as e — > 0. This proves in the case that gi2 has finite 
rank. 

For a general gi2 on Tii (8) ^2, let ^2 = Q12, where denotes the 
projection onto the largest j eigenvalues of gi2- It is then easy to see that 

S[qi2] = Mm Siffi^] (59) 

and 

S[0a] = lim S[Tt2 ^[2] (60) 

(cf. the Appendix in P). Moreover, with n^{Y) and ^ defined as above, 
for the operator ^[21 write 



dYn^{Y)S[glj] = e j dV niV) S [g^ {nHV) / e n{Y))] 

dY n^{Y) In {en{Y)/n^{Y)). (61) 



Note that n^{Y) is pointwise increasing in j, and also n^{Y)g^j is an in- 
creasing sequence of operators. Moreover, g^ j{n^ {Y)/en{Y)) < 1/e. Since 
— xlnx is monotone increasing for < x < 1/e, this implies that the first 
term on the right side of (|61j) is bounded from above by 

e j dYn{Y)S[g^/e] = j dYn{Y)S[gl] + J dYn{Y). (62) 

Moreover, since limj^oo 'T-"' (^) = n{Y) for almost every Y, this implies, by 
monotone convergence, 



lim — / dY n-' 



' (Y) In (e n{Y) jn^ (Y)) = - j dY n{Y) . (63) 



This shows that H55() also holds in the infinite rank case, and finishes the 
proof of Corollary [l] □ 



A Extension to Integrals 

In this appendix we extend Theorem |21 in the following way. Let again TCi be 
finite dimensional Hilbert spaces, 1 < i < 3. Let be a measure space, with 
elements labeled by a, and let d^{a) be a measure on Vt. Let K'^ be matrices 
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on Til (E> 'H2 that are weakly measurable (i.e., all the matrix elements are 
measurable functions), such that 

/ dfiia)K''* K'^ = h2. (64) 

The extension of Theorem |2l is the following. With the same definitions as 
in ((TT|)-(P|). 

S[gi23] - SiQu] < [ df^ia) {S[q%] - S[q^]) . (65) 
Jn 

Note that this expression is well defined, since the integrand is measurable 
and the entropy is bounded. 

For the proof of ()65|) . we may assume that, for each a, < 1. 

This is no restriction, since we can always absorb the norm into the measure 
dji{a). Likewise, we may assume that TyK°'*K°' > 1/2. Taking the trace 
of (|64|) . it is then clear that Q. has finite measure. 

Pick some e > 0. By looking at the level sets of the matrix elements 
of X", we can divide into finitely many disjoint measurable subsets Oj, 
1 < j < with - Kl^W < e if a and /? are in the same subset Oj. For 
each a G ri, write K"" = Lf^iK"^* K")^/^ , with [/" unitary. For each j, pick 
some aj £ Oj, and define 

/ \ 1/2 

V = U''^lj dfi{a)K''*K''] . (66) 

We then have '^jU*U = I12, and hence we can apply Theorem [2 That 
is, we have, for any density matrix qi2z on Hi ® Ti.2 Ti^, 

S[gi23] - S[gi2] < {^1(^23] - Siei]) , (67) 

j 

with = Tri23L^* Q123 and ^23 = Th:iU Q123LP* /riK We will now show 
that, as e ^ 0, the right side of (|H7|) converges to the right side of 

Using the fact that \\^fA — ^fB\\ < \\A — BW^^"^ for any two positive 
matrices A and B 1231 Eq. X.2], we can estimate 

\\L^ - /2^"^ ||2 < f (in{a)\\K'^* - < 2£\Oj\. (68) 

7o, 
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Here, | • | denotes the measure of a subset of il, and we used \\K"\\ < 1 and 
\\K" — K"^ II < e in the last step. Using the triangle inequahty we thus see 
that \\U - I 1 ^"11 < \Oj\'^/'^{e + V2e) for any a€Oj. This implies that 

WL^gusL^* - \Oj\ /^"^i23K"1| < |0,|2(e + V2^)\\qi23\\ (69) 

for any a € Oj, where we again used that Hi^'^H < 1- Note that (|69|) imphes 
the same estimate for the partial trace of the operator on the left side, with 
the right side multiplied by the dimension of the space. Moreover, since in 
finite dimensions the entropy is Lipschitz continuous, this also implies that, 
for some constant C > 0, 

j 3 

f dfi{a) (s[Tti K°^i23i^°*] - S[Tri2 K'^gusK'^*] + C{e + V2^)) 

= / dfiia)n"{S[Q^,]-S[g^])+Cme + V2^). (70) 
Letting e ^ this proves the desired result. 



B Extension to Infinite Dimensional Spaces 

We now show that ()65() holds in the case when the TCi are separable, possibly 
infinite dimensional, Hilbert spaces. This will prove Theorem ^ As stated 
there, we assume that the K°' are bounded, weakly measurable operators, 
i.e, {(p\K°'\(l)) is measurable for all vectors \(p). (By polarization, this implies 
that the matrix elements {(j)\K"\ip) are measurable for any vectors \<j)) and 
1^).) We also assume (jlj to hold. Again it is then no restriction to assume 
that ||i^"*ii'"|| < 1 for each a G Q. Note that 17 need not have finite 
measure in this case, however. We assume that the density matrix ^>i23 has 
finite entropy. We may also assume that /S'[^>i2] is finite, otherwise there is 
nothing to prove. Note that in this case, the triangle inequality for entropy 
implies that also Slg^] is finite. 

For m S N, let pj^"^'^ be rank m projections in Tli, such that P^"^^ — > I^. 
strongly as m ^ cx). Let P = P^'^'^ ® P^"''^ ® P^'^\ Then also P I123 
strongly as m — > 00. In the following, we find it convenient to suppress the 
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dependence on m in our notation, but rather put a ^ on all quantities that 
depend on m. 

Given a density matrix ^1123, we define ^123 = PQi2iP- Moreover, let 
Jl^ = p['^^ pM^apM ^ pbn) for a G rj. Let also 



L 



p(m) ^ p(m) j ^^^^^ ^a*p(m) ^ pM^a^ pM ^ pjjn) 



If we set Hi = P-"^^TLi, we then have 



1/2 
(71) 



/ dfi{a)K^*K^ + L'' = I^^^, (72) 
and hence we can apply the finite dimensional result (|65|) . I.e., 

S[Qi23]-S[gi2]< I dMa)n-(5[^3]-5[^])+n^(5[^y-5[^^]). (73) 



n 

Here, n" and the density matrices ^3 and ^ are defined as in ((TT|) - ()13|) . 
with K'^ replaced by and ^5123 replaced by ^123- Moreover, 5^23' S2 ^^^^ 

are defined in the same way, with L in place of K°^. Our goal is to show 
that we can remove the ^'s in ((75|) . 

Note that ^123 — > i?i23 strongly as m ^ 00. Since also the trace of ^123 
converges to Trf)i23 = 1, this implies that the convergence is actually in trace 
norm, as proved by Wehrl in |^. Hence also Q12 gi2 in trace norm. Since 
the eigenvalues of ^123 are smaller than the corresponding eigenvalues of ^123, 
i.e., ^123 <|; £'123 in Simon's notation in the appendix of 1 , Theorem A2 in 

implies that 

lim 5'[^i23] = 5'[£ii23]- (74) 

m^oo 

By the same reasoning, this also holds for qi2. Taking the limit m ^ 00 in 
(|73p . we thus have 

S[qi23] - S[qi2] < liminf / dfi{a) {Am{a) + Bm{a)) 

+ liminf n^(5[?y-5[?i']). (75) 

m— >oo 

Here we defined the functions Am{a) and Bm{a) by 

AM = n"(5[^3] - S[^] - S[^]) (76) 
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and 

i?„(a) = n"5[g], (77) 

with = Tr2 ^3 = Tri2 K'^*K'^Qi23/n°' . The reason for the sphtting into 
the two parts H76() and ()77() is that Am{a) is negative, which allows for the 
use of Fatou's Lemma, whereas Bmia) depends on K°' only through K°'*K°'. 

We start by estimating the last term on the right side of H75|) . By 
subadditivity of entropy, ^[^gs] ~ ^[02] ^ S[q^], with = Tr2523- We 
claim that limm^oo = 0- This is true if we can show that 

lim f dii{a)Tn2zPK''*PK^PQi2Z = l. (78) 

Since PK"* PK^'P converges strongly to K"*K" for each fixed a (because 
products of strongly convergent sequences converge strongly), we see that 
Tti23PK°'* P P Q123 converges to T1C123K"* Q123 for each fixed a. Hence, 
using Fatou's Lemma, we see that the left side of (|78|) is always > 1. On 
the other hand, estimating the middle P in the integrand in H78|) by P < 1 
and using @, we see that the left side of ((75|) is bounded above by 

lim Tri23 -P£'i23 = Tri23 0123 = 1- (79) 

m— »oo 

This proves the claim that limm^oo = 0. 

Although S[g^] need not be bounded, we claim that n^S[Q^] — > as 
m — > cxD. To see this, write n^S[g^] = 5'[Tri2i^^^i23] +n^lnn^. Note that 
the second term goes to zero as 0. Since < I12, Tri2i^^^i23 < 

-^3™^ ^3-^3™^ <t 03- Recall that S[q3] is finite. Since Tri2-L^^i23 ^ in 
trace norm as m ^ 00, we can use dominated convergence Thm. Al] to 
conclude that S'[Tri2-L^^i23] ^ as m — > 00. Hence we have shown that 

limsupn^(S[^y-5[^^]) <0. (80) 

Next we treat the first term on the right side of ()75() . i.e., the integral of 
Am{oi)- Since Am{oi) < (by subadditivity), we can use Fatou's Lemma to 
estimate 



liminf / dfj,{a) Am{a) < / (i/u(a) limsup vl^ 

Jn Jn m-*oo 



(«)• (81) 



a 



We now claim that ^3 g^^^ in trace norm. It is clear that i^"*^i23i^' 
converges strongly to K°^* gi23K°^ . By the same argument as that after (|78|). 
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the trace also converges, and hence the convergence is in trace norm. This 
imphes that the reduced density matrices also converge in trace norm, and 
hence proves our claim. Moreover, we claim that, for each fixed a, 

hmsup AM < n'^iSlQ^s] " S[q^] - S[q^]). (82) 

m^oo 

This follows from upper semicontinuity of 5[^>23] — S[g2] — S[q3] in ^23- This 
upper semicontinuity, in turn, follows from lower semicontinuity of the rela- 
tive entropy H{g, a) = Tr g{ln g — lna) ^1 2.2,22(ii)] , since 5'[g23] — S[g2] — 
Sig-i] = —H{g23, g2 gs)- By combining (|ST|l and we have thus shown 
that 

liminf [ dfi{a)A^{a)< [ dfi{a) n"" (S[g^.] - S[g^] - S[g?]) . (83) 
"^"c Jn Jn 

It remains to show that 

liminf / d^x{a) Bra{a) < f dn{a) S[g?], (84) 
Jn Jn 

with 

Bm{a) = = 5[Tri2i?^*i?^?i23] +^"lnn". (85) 

Note that Tria ^*^?i23 < Pf^gsPf^i gs, since \\K°'*K''\\ < 1 and 
hence llEr"*!^"!! < 1. By the same argument as above, Tri2 K°^*K°'gi23 con- 
verges to Tri2 -ftr"*^"^i23 in trace norm, and hence, again by Theorem Al 
inP, 

lim Bm{a) = (86) 

m^oo 

for each fixed a. This gives pointwise convergence, but to show conver- 
gence of the integral in ()75() we have to use the dominated convergence 
theorem. We claim that Bm{ct) is uniformly bounded, independent of m 
and a. This is true since —xlnx is monotone in x for < x < 1/e. Since 
Tri2 K°^*K°^gi23 <^ g3, the contribution to the entropy from eigenvalues less 
then 1/e is bounded by the corresponding value for ^53. Moreover, since 
Tr £>3 = 1, there are at most 2 eigenvalues bigger than 1/e. This gives the 
claimed uniform bound. 

By dominated convergence, we see that, for any subset O C Q with finite 
measure, 



m— >oo 



Jim / dfi{a) B„^{a) = / dfi{a) n"" S[g1] < / dfi{a) n"" S[g1]. (87) 
'o Jo Jn 
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Moreover, by concavity of ^[i)], 

/ d^x{a)Bm{a)<( I dii{a)nAs[ri (88) 

JO'' XJO" J 

with 

Tri2 / d^i{a)K^*K^Qi2^. (89) 
JO" 

Proceeding as in the proof of H78() . we see that 

hm / dn{a)n°'= / diji{a)n". (90) 

Also Tri2 Jcje dn{a)K^*'K^gi23 Tri2 f^, dn{a)K'^* K'^ qi2z weakly, and 
thus in trace norm. Since again Tri2 d^{a)K°'* K°'qi2z <t Qz-, the same 
argument as above implies that the right side of (|88|) converges, in the limit 
m — > oo, to the corresponding expression without the ^'s. I.e., 

limsup / d^i{a) Bm{a) < ( I dfi{a)n'^] S[q% (91) 

m^oo JO" yJo" J 

with q'^ given as in 1)89(1 . but with all the ^ removed. 

Now as O — > fi, diJ,{a) n°' — > 0. Using again the dominance by £)3, 
which follows from the fact that J^c dfi{a) K"*K'^ < I, we see the right side 
of ()9ip goes to zero as O ^ $7. Together with ()87() . this finishes the proof 
of (|84|) . and hence the proof of the theorem. 



dn{ 



a n 



QC 



C Ruskai's proof of Corollary [2] 

In an early version of this paper we had a weaker version of Corollary |21 
which read S[qi2] — S[qi] < S^\qi2\ — <S"^'[^>i]. In a private correspondence, 
M.B. Ruskai suggested the stronger version for the case of (product) coherent 
states, and her suggestion motivated us to prove the strengthened version 
of Corollary 121 using our methods. The following is a sketch of her proof. 

1. ) The map from a state to to its coherent state representation is a 
completely-positive, trace-preserving map (CPT). 

2. ) The relative entropy of two density matrices, H{g,ij,) = Tr g{lnQ — 
In /u) is known to decrease under CPT maps. 

3. ) Apply 2.) to H{qi2, Qi (X" Q2) using the (product) coherent state map, 
to obtain 
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